Skip to content

Create perplexity-lens.md #32

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

iamaayushijain
Copy link

@iamaayushijain iamaayushijain commented Aug 5, 2025

Description

Brief description of your contribution

Type of Contribution

  • Example Tutorial
  • Showcase Project
  • Article/Integration Guide
  • Documentation Update
  • Bug Fix
  • Other (please describe)

Checklist

  • My code follows the cookbook's style guidelines
  • I have included comprehensive documentation
  • I have tested my code and it works as expected
  • I have included all necessary dependencies and setup instructions
  • My MDX file includes proper frontmatter (title, description, keywords)
  • I have linked to any external repositories or live demos

Project Details

What problem does this solve?
Many developers struggle with extracting meaningful, structured data from long-form PDFs—especially ones with inconsistent formatting, multi-column layouts, or embedded tables. This project solves that by providing an offline, lightweight PDF parsing toolkit that can extract structured text, headings (H1-H3), and tables into a clean JSON format.

What makes this contribution valuable to other developers?
This toolkit uses multiple parsing libraries to maximize accuracy and coverage, allowing developers to skip over the tedious process of manually extracting content from PDFs. It is especially useful for researchers, technical writers, and anyone building search or summarization features over document data. Being offline-first makes it reliable for sensitive or restricted environments.

GitHub Repository
Live Demo (View Shared Graph)

External Links (if applicable):

Testing

Smart Text Selection: Selected various text samples on different web pages to check if AI-generated explanations were provided accurately and quickly.

Webpage Summarization: Used the “Summarize” feature on diverse sites (articles, blogs, documentation) to verify concise and relevant summaries.

Retrieval-Augmented Insights (RAG): Hovered and clicked on words/phrases for context retrieval and ensured that RAG-based results were accurate and contextually relevant.

Knowledge Graph Visualization: Added multiple concepts, navigating, zooming, and dragging nodes to confirm the D3.js graph responded smoothly and displayed correct connections.

Public Sharing: Generated and accessed shared graph URLs to validate that public sharing worked as intended, without exposing private data.

Screenshots (if applicable)

Screenshot 2025-08-05 at 12 03 10 PM Screenshot 2025-08-05 at 12 03 45 PM Screenshot 2025-08-05 at 12 04 58 PM

Additional Notes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant